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ABSTRACT 



A method for a computation agent to acquire a queue lock 
in a multiprocessor system that prevents deadlock betwee n 
t he computat ion agent and eytema| iptftr nipts. The method 
provides for the rnmpujatipn agent to join^ygu^aej o acquire 
a_locj^ Next, upon r^eceiying^ \smersliir^c^ the 
computation age nt raises its.p qoril\UeveLto,aiiighec.second 
priority level. In res pons fc to a receipt. of_an.e j xternaljntec rupt| 
havi ng a h i^he r r prio rity- level jj.ccurri n g ( be fo re the_co mpu 
taUonjige nt basj aised.its,priorityJeyeLto,the.se,cond.higher 
priopty.level, the.computatipn agentxehpfl uishes ownership 
of the lock. Subsequent to raising its priority level to th 
second mgher priority level, the computation agent deter- 
mines if it still has ownership of the lock. If the computation 
agent determines that it has not acquired possession of the 
lock after raising its priority level, the computation agent 
rejoins the queue to reacquire the lock. In one embodiment 
of the present invention, the computation agent's priority 
level is restored to its original, i.e., first priority level, when 
it rejoins the queue to reacquire the lock, 

26 Claims, 2 Drawing Sheets 
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SCALABLE INTERRUPTIBLE QUEUE 
LOCKS FOR SHARED-MEMORY 
MULTIPROCESSOR 

BACKGROUND OF THE INVENTION 

1. Technical Field of the Invention 

The present invention relates generally to data processing 
and in particular to shared-memory multiprocessors. Still 

more particularly, th ^present invention rentes tn srjilflhlp. 

TLinterruplible queue locksJar a shared-memory: multiproce s- 
sor and a method of operation thereof. . 

2. Description of the Related Art 

Operating system kernels require efficient locking primi- 
tives to enforce serialization. Spin locks and queue locks are 
two common serialization mechanisms. In addition to scal- 
ability and efficiency, interruptabilitv *is a desired trait. 
Because of atomicity requirements, a thread may have to 
raise its priority level before entering a critical section that 
manipulates memory. Additionally enahlinpjhe thread to be 
i nterrupted while it is wai ting for th r. lork increases th e 
r esponsiveness of the syste m trr int A rniptfi 

A spin lock is a simple construct that uses the_sache 
cohere nce mec haniani in a multiprocessor system to control 
access^ip^^SieaLaectio n. A typical spin lock implemen- 
tation has two phases. In the spin phase, the waiting com- 
putation agents, or threads, spin on a cached copA LaLa single 
gloB^no^vanable. It should be noted that in the context of 
operating system(OS) kernels, there is generally a one-to- 
one^ corr espond en ce be tween computation a gents , or 
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threads; and processors.'IfTthe compete phase, the waiting 
computation agents all try to atomically modify the lock 
variable from the available to the held state. The one 
computation agent that succeeds in this phase has control of 
the lock; the others go back to the spin phase. The transition 35 
from the spin to the compete phase is initiated when the lock 
holder releases the lock by marking the lock variable as 
available. 

Spi n locks ha yetwo main advantages: they req uire only 
a few instructions to implement and thev ar e easily designed ^ 
to be internip tihle. The main disadvantage of spin locks is 
that they do not scale well. The compete phase can cause 
significant contention on the system buses when a large 
number of computation agents simultaneously attempt to 
acquire the lock. Spin locks are thus suitable only for lightly 45 
contended locks. In addition, since the lock is not necessar- 
ily granted in first in first out (FIFO) order, spin locks are 
typically not fair. 

In a^q ueue lock, com piitatioj L agents queue up to acqui re 
t he lock. TV lor-lf hnMp.r releas e the lock by granting it to 50 
a c omputation ag epta t the he_ ari nf the, mmue. If there are no 
computation agents in the queue, the lock is simply- marked 
available to the next c omputa tion agent th at tries to acqu ire 
^.Queue lock implementations typically involve two 
phases. In the enqueue phase, a cojapjUaiioji_ageatJoins the 55 
queue byatomicajlvu pjatin g the q ueue datasl nicture. In the 
spin phase, qu eued comp^ utatipjiagejts^piD^ 
lock to be granted. In contrast to spin locks, the computation 
al^nts~in~nju mie lock sains , at .a^Stjjffi memorvlocations 
that typically map to disJ Lmct_&EQfi3iags. ^"TocTcnolder 60 
nolifies~that the lock is available by updating a single 
computation agents spin location. Since the computation 
agents spin on distinct memory locations, the lock holder 
wakes up only one computation agent when it releases the 
lock. 65 

Queue locks are generally more complicated to imple- 
ment than spin locks. Their main advantage is that they scale 



well. Unlike spin locks where a lock release causes a 
free-for-all among the waiting computation agents, at most 
one computation agent is woken up in a queue lock. This 
makes them particularly suited for heavily contended locks. 
Queue locks can enforce fairness by having the queue data 
structures preserve the order in which computation agents 
enqueue themselves. The main disadvantage of queue locks 
over spin locks is the increased number of memory opera- 
tions caused by the enqueue-spin-wakeup cycle. For lightly 
contended locks, these extra operations can significantly 
increase the time it takes to acquire a lock. 

A n^interrup tibje j pck js one that can handle interrup ts 
be tween. me_tim_e w hen_a,com putation a g ent expresses a 
de sire to a c^mre_aiock and the time that it actually acquire s 
it. Depending on the contention for a lock and the time spent 
inside the critical section it controls, a significant period of 
time may elapse between when a computation agent begins 
the process of acquiring the lock and when-iLfinaJly^gets 
control. In order to~pFeserve atomicity of accesses to key 
data structures, the system may enforce the restriction that 
the critical section be entered only at a h igh interrupt'reque st 
level (IRQL). The priority level is typically an attnbuteof 
the operating environment wherein only interrupts above a 
certain priority will be serviced. On many systems, this is 
referred to as the IRQL. The ideal procedure for handling 
this situation is to have a computation agent wait for the lock 
at a lower IRQL and raise the IRQL only after the compu- 
tation agent gets control of the lock. 

TABLE 1 

Spin lock 

begin: while (lockvar held); 

[temp - lockvar; lockvar « heldl^rojmc 
if (temp available) 
return 

else 

goto begin 



Consider the spin lock algorithm shown above in Table 1. 
Table 2 below illustrates a mechanism for incorporating 
interruptability in this algorithm. The idea is to raise the 
IRQL just prior to entering the compete phase. If the attempt 
is successful, the lock would have been acquired at the 
higher IRQL. If the attempt fails, i.e., another computation 
agent has acquired the lock, the IRQL is restored to its 
original level. This ensures that any interrupt that would 
have been serviced at the original IRQL will continue to be 
serviced while the computation agent waits for the lock in 
the spin phase. 

TABLE 2 
[nterruptible spin lock 

oldirql - GetIRQL( ); 
begin: while (lockvar held); 
SetlRQL (HIGH); 

[temp - lockvar; lockvar = held] ATOMIC 
if (temp =- available) 

return 
else { 

SetlRQL (oldirql); 

goto begin; 

> 

Making queue locks interruptible is not as simple as in the 
case of spin locks. The problem lies in the fact that in a 
queue lock, a computation agent can be granted the lock at 



04/12/2004, EAST Version: 1.4.1 



US 6,473,819 Bl 

3 4 

any time after it joins the queue. Contrast this with a spin In yet another embodiment of the present invention, the 

lock where a computation agent knows that it will not get computation agent relin qyishes ownershj p_of the lock by . 

c ontrol of the lock unless it initiates the compete phase. The relea sing the lock to a second jxmputajion aggn l. In a related • ^ 

straightforward approach of raising the IRQL after a queue embodiment, the second computation agent is the^n^xt v >o0 

lock has been acquired creates a window of vulnerability 5 com puufon^agenr wbose'blt ^sitipjiinJbe-^ e b al bitma s k V u 

between when the lock is acquired and the IRQL is raised. (thafTracks requests forthe lock) is to the TeTTof the bit 

During this transition period, a deadlock co ndition-may position on the left of the first computation agent, which has 

occur . For instance, consider a low priority level external ownership of the lock, in the global bitmask that wants 

interrupt that occurs within this window. Furthermore, in the ownership of the lock. It should be readily apparent to those 

process of servicing this interrupt, let it be necessary to 10 skilled in the art that, in other advantageous embodiments, 

acquire the queue lock under consideration. The deadlock t he .ownership of the lock may be __pa&sed-4o— the— next 

situation is now apparent. In order for the interrupt service computation agent on the right of the first com putatian agent 

handler to obtain the lock, the lock needs to be released. i hat - indicate s— a -_desire to acquire the lock . The present 

However, in order for the lock to be released, the interrupt invention does not intend to limit its practice to any one 

service handler must finish, enabling the original lock 15 particular directi on bu t requires that which ever direction 

acquire-re lease cycle to complete. Thus, a more sophisti- (right or left) is gelecteft be maintained to prevent st arvation 

cated mechanism is required. by eventually granting the lock to every wa iting .compu ta- 
ti on age nt. Furthermore, in_non-uniform memory (NUMA) 

SUMMARY OF THE INVENTION systemTthe Jock holder may give preccde nccUo.computation 

It is therefore an object of the present invention to provide 20 agents that reside on the same node as itself, taking care to 

an improved^ultiproc^Vor systenO avoid starvation by not doing "this too many times in a row. 

It is another object of the present invention to provide The foregoing description has outlined, rather broadly, 

scalable interruptibl e q ueue locks for shared-memory mu l- preferred and alternative features of the present invention so 

tiprocessors and alnetE od of operation thereof. that those skilled in the art may better understand the 

To achieve the foregoing objects, and in accordance with 25 detailed description of the invention that follows. Additional 

the invention as embodied and broadly described herein, a features of the invention will be described hereinafter that 

method for a computation agent-t o acquire a queueJa ckJn f° rm tne subject matter of the claims of the invention. Those 

a m^processor.system that pre vents deadlock betwe en the skilled in the art should appreciate that they can readily use 

computation agent and ex^ejnaj/Suej^Js^ is disclosed. The 3Q the disclosed conception and specific embodiment as a basis 

method provides" forthe' computa tion agen t to ioin_ a_oueue for designing or modifying other structures for carrying out 

to acquire_aJock. Next, u^on receiving ownershipof the the same purposes of the present invention. Those skilled in 

lock, the ^omputaiion^gent rais ^ns^io1rtt\n ey, e_J 1 "lo" a the art should also realize that such equivalent constructions 

jjgt^second.priori t ^ an do not depart from the spirit and scope of the invention in its 

^ eternal Interrupt having a higher priority ley_e Loccucring 3S broadest form. 

iefore the comp utation ageTir nas raised its priority level to 

5T"seco na~figier priorit7"leveirih^Ic gm^{a^n^ent BRIEF DESCRIPTION OF THE DRAWINGS 
rdmqmshes-owr}^ to raising its 

pnoFity level to the second higher priority level, the com- For a morc complete understanding of the present 

putation agent deter mines if it_ stUl-Jias-0^erslun^ofahe invention, reference is now made to the following descrip- 

lock. If the computation agent determines that it has not | 10ns laken in conjunction with the accompanying drawings, 

'-acquired possxssiori^fMhe4o^^ftei;raisinfi its priority le vel, m which: 

the computation agenUgjgja&Jhg^gu eue jo ^reacquire _th e FIG. 1 illustrates an embodiment of a multiprocessor 

lock- system that provides a suitable environment for the practice 

~~Tne present invention introduces a novel methodology of 45 of the present invention; and 

implementing q ueue locks that allows for Merrup tagt& nG. 2 illustrates a high-level flow diagram of an embodi- 

fromexte^nalmierrupis wniie elimm^g^y_d^dJock men , of m mterruptible queue lock acquiring process using 

c^Sm-Tte present invention accomplishes this by ^ rinci les b ^ nt invention . 
permitting the com puta tion agent that has been g iven ow n- 

ersju££fjJockJab^ 50 DETAILED DESCRIPTION 
waitin g commutation a g ent when an int ervening interrupt is 

encountejeddum^^ Wilh re f er ence now to the figures, and in particular, with 

priority level. reference to FIG. 1, there is depicted an embodiment of a 

In one embodiment of the present invention, the compu- multiprocessor system lQ flLthat provides a suitable environ- 

tation agent's priority level is restored to its original, i.e., 55 ment for the practice ofthe present invention. Multiproces- 

first priority level, when it rejoins the queue lo reacquire the so r system 100 includes ^rst^ s econd and third pr ocessors 

lock: In a related embodiment, the second priority level of UOo^UPl). 110c t^at are cou pled tn a plurality ^£ memory ; 

the computation agent is higher than the priority level of the £^r^^ y H P!S T£nVf H l^-A", l^/llOryda-a^y^ em bus 11 5. 

external interrupt. Also shown coupled to system bus 115 is a bridge 130, such 

In another embodiment of the present invention, the 60 as a PCI bridge, that couples the plurality of processors 

queue is implemented using a global bitmask wherein the HOa-llOc and memory 120a-120c to first, second, third 

number of bits in the global bitmask is emiaLt p the nu mber and fourth devices 140, 150, 160, 170. In the illustrated 

of compu ta^n_agents-m-the_multiprocessor-system. It embodiment, the fourth device 170 is a network bridge that 

should be noted that, in other advantageous embodiments, connects the multiprocessor system 100 to other systems in 

the-numher of hits in Jh e global bit mask is greater than th e 65 a local area network (LAN) or, alternatively, in a wide area 

mimher of p r ocess ors lnthe muitiproc essorsystem allowing network (WAN) configuration. First, second and third 

for^dabiUty_inJne^ysteED. ~ " ™ devices 140, 150, 160, in an advantageous embodiment, are 
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sensors, actuators and network controllers that are part of a 
real-time system that provide external interrupts to first, 
second and third processors 110a, 110b, UOc. In an advan- 
tageous embodiment, each processor is dedicated to service 
It he interrupts generated bv a specific or a ^roup of specific* 

As discussed previously, a problem arises in deciding 
whether the mteminLpr ipritv level should he -f ais e d - first er 
the lock be acquis first. When the interru pl p riority lev el 
isr aised only after Unlock acquire succeeds, lower priority 
interrupts may be serviced after a comp utation agent, i.e ., 
processo r, has acquired the-lock. Therefore, the condition 
that lowe^pjjority interru pts should be inhibitedj wjiile a 
co mjuU atipn agents holds a lock cannot be satisfie d. 
Conversely, if the lock acquire is attempted after raising the 
interrupt priority level, lower priority interrupts will not be 
serviced while the computation agent is waiting for the lock. 
Since a significant amount of time may elapse before a 
waiting computation agent acquires the lock, the respon- 
siveness of the system to lower priority interrupts decreases. 

Referring now to FIG. 2, there is illustrated a high-level 
flow diagram of an embodiment of an intemiptible queu e 
l ock acquiring process JS00 using the principles_disclosed by 
the present invention. Process 200 is initiated when a 
computation agent joins a queue, as depicted in step 210. 

he queue, in an advantageous embodiment, is a single 
gl obal bitmask that keeps track of the computation agent that 
^curr ently holdin g, i.e., having possession ^ theTock and 
be ^mputation-age^ts^mat_are _waitm gror tne log K: It 
should be noted that the number of bits in the global bitmask 
should be at least equivalent to the number of computation 
agents employed in a system. For example, for the system 
100 depicted in FIG. 1, since there are three pr ocessor^ 
there sho uld be at least three bits inthe globa l bitmask . It 
shoukfbe rejdUy^apparentJo those skilled in the art that, in 
other advantageous embodiments, the number of bits in the 
global bitmas k is greater than _the^number-of-{>rocessoFs in 
the m ultiprocessor s ystem allowing for scalability in the 
system^ 

Each computation agent has a distinct spin location, also 
called a "slot," where iL spinsjyhUe^waitmgjfQr Jhe loeien o * 
be-g ranteftlQ_i L. In an advantageous emb odimen t th e_slot 
asso ciated with each computation^a gent is locate d Igcallyjo 
its correspondin g ^) rocessor . Additionally, computation 
agents may also have some local, i.e., private, state that they 
use to co mmun i cate with _ their_intemiP.tJiandlers. The 
acquire operation is first described. An exemplary pseudo- 
code entitled Acquire 0 detailing the acquire operation is 
listed below in Table 3. 

TABLE 3 

Lock Acquire 

Initial Values: oldmask-lNVALlD, INT_HAPPENED«FALSE 
Acquire (ncwIRQL) 
{ 

retry: disable_intcrnipts 

[old mask - MASK; MASK !- mypid] 
Slot - WANT 
INT__HAPP ENED » FALSE 
enable_Jnterrupts 
if (oldmask != 0) 

while (slot — WANT && 

INT_HAPPENED — FALSE) ; 
oldirql = SetlRQL (ncwIRQL) 
oldmask » INVALID 



50 
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60 



65 



TABLE 3-continued 



Lock Acquire 



10 



if (INT-HAPPENED — TRUE) { 
sctlRQL (oldirql) 
INT__HAP PENED - FALSE 
goto retry 

} 

I NT_ HAPPENED = TRUE 
slot - GOTTT 
return oldirql 
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I n order to acquire a lock, the . com p utation agent first I 
dis ables intermp tsand atomic^y^btains^the.existin£^,alue \ 
ofthe global rMtmask andsets itsjbit iiyhe.glpbaLbitmask. 
The computation agent also preserves the existing value of 
the global bitmask (referred to as the captured bitmask). This 
will, in turn, indicate to the computation agent V interrup t 
hand lcr_of the comp utatioii_agenL.intenli on to acqu ire the 
lock. Subsequently, with interru pts stiLL disabled. the_com - 
putaii on ag ejiUrutjalizesJis siol.toilWANT, i " i.e., i nforming 
anyonereleasjn g the lock that _th e_com putatio n-agent is 
pre pared toJa ke^ pwnership^ At this time, it is safe to accept 
interrupts and consequently, the computation agent enables 
them. Following the enablement of the interrupts, the com- 
putation agen t checks tQ^e.temiine^if J.Uhas-been^gjajited 
ownership of the lock or have been interrupted ,C5pinning 
un til one ojjhesejcon dito " 

After receiving ownership of the lock, as illustrated in 
step 220, the computation agent eliminates interference from 
interrupts by raising its priority level, i.e., IRQL, to a second 
priority level required by the lock as depicted in step 230. 
The second priority level is typically greater, or higher, than 
the priority levels of any external interrupts. 

Next, during the transitional period when the computation 
agent is raising its priority level to a higher level, process 
200 inquires if an intervening interrupt has been received, as 
illustrated in decisional step 240. If it is determined at 
decisional step 240 that the acquiring computation agent has 
raised its priority level to the second higher priority level, 
process 200 continues without further action to decisional 
step 260. 

At this point in the process, the computation agent sets its 
captured global mask value to "INVALID", allowing inter- 
rupts to still occur, but directing the interrupt handler not to 
take any action. Since jthe cojn putatiQn.agentls.priority-level 
js-aiOheJocki s IRQ L, no action .oe eds to bei akenbv the 
intexnipLha ndler . At this point, it can safely determined if an 
intervening interrupt has occurred during the transition 
period. whe n the com putation agent first receives ownership 
ofjhgjgckand when it^a^ ^)i^^io^y^lev el to the^se cond 
higher priority leve^ asHIustrated in decisional step 260. 
Since in this case, no intervening interrupt occurred during 
the transition period, the computation agent now^has pos- 
session of the lock and it returnsjts^origmai^5rity_leyel to 
its caller, as illustrated in step 270. 

On the other hand, if the determination at decisional step 
240 is that an intervening interrupt was received, the com- 
putation agent, as depicted in step 250, reHn^uisheaJiwjier- 
ship of _the_lock. An exemplary pseudo-code entitled 
ReleaseQ for operations performed at a lock release is listed 
below in Table 4. 
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TABLE 4 



Lock Release 



Release ( ) 
{ 

1. slot - DONTWANT 

2. retry; [if ((oldmask - MASK) — mymask) MASK - 0] 

3. if (oldmask **** mymask) 

4. return; 

5. candidate = Chosen thread from MASK 

6. cslot = candidate's slot address 

7. [if ((oldcslol - -cslot) — WANT) 'cslot - GOTTT] 

8. if (oldcslot ~ WANT) 

9. MASK &- Imypid 
return 



10. 
} 



} 

else goto retry 
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The compjuUuon-agenUeleas^theJo^^ its 
s^oJJoJJIQl^nm has the effect of preventing any 2 o 

iuTure computation agent that may end up using a cached 
and invalid value of the global bitmask from assigning it 
back to the releasing computation agent. Next, the compu- 
tation agent atomically reads the global bitmask, clearing its 
associated bit to zero if it is set in the bitmask. It should be 
noted that if the global bitmask is zero, indicating that there 
are no computation agents_waiting_ for the lo ck, the release 
operation is done. 

On the other hand, if there are computation agents waiting 
to acquire the lock, the computation.agen lreleasin gjhe lock 30 
mus£sele3 onej)f.tbejvvaiting computatlo ii agents and grant 
it_t he lock.. Since a bitmask.d ae^uJio^preser ve^ e^oTder in 
which^e_rornputation agents have q ueuedjt p.forjhe lo_ckr 
the currentJocK[^ld^lm^t„actively choose o ne, o f the 
waj^g_compmajion"jgents and gFant irine~lock . The 35 
present invention, in a preferrecI"ernHocliment, requires the 
lock^h^lder^j o examine the bjtoa sloand^selectjhe^first 
computoUon.agelTOs^ted.toats JeTfin" the bit mask. It should 
be notecTthat this^selertioirsc^ 

( nFQ )_si nce^it .giYcs^pr ecedcnce_ to _a computation ag ent 40 
closer to.meJockJ^^r.(gr^it^left) u This scheme, however, 
prevents starvation.b.y: eyenmallv^ gran^g ^JieJa ck to every 
waitmg-computation.agent. Furthermore, fry~giving~prece- 
de^celo^computatian~agent closer to the lock holder than 
a computation agent farther away, the present invention may 45 
r educe memory tfa jSgSn lar ge non-uniform memory acc ess 
(NUMA) system sTTTsnould be readily apparent to those 
skilled in the art that, in other advantageous embodiments, 
the owners hip of the lock may be passed to_ the__pgxt 
commitMio^aieni on the right of the first com pulation agent 50 
that indicates a _ desire to acquire the lock. The present 
invention does not intend to limit its practice to any one 
particular direction but requires that which ever direction 
(right or left) is selected be maintained to prevent starvation 
by eventually granting the lock to every waiting computa- 55 
tion agent. Furthermore, in NUMA systems, the lock holder 
may give precedence to computation a geqts that resid e on 
the same node as itself, taking care to avoid starvation by not 
doing this too many4imes_in a row. 

In those circumstances where there is a computation agent 60 
waiting to acquire the lock, the slot loc ation on which the 
waitin g computati on ajzent is spinning i s first computed . 
This~can be easily accomplished since each computation 
agent has a distinct slot on which it spins. Next, the slot 
value is atomically captured and examined. If the slot value 65 
was "WANT, " i.e., indicatin g that iLwanis t he lock, the slot 
value is then changed to "GOTIT." Furthermore if the slot 



value is "WANT/' the handoff is complete and the releasing 
computation agent is now free WJjirn off it^ hi* in lh p 
bit mask and jeittrn. It should be noted that a value other than 
"WANT" indicates that the chosen compulation agent has 
either been interrupted or that a "stale" version of the 
bitmask has been examined. If this occurs, the releasing 
computation agent selects another waiting computation 
agent by reexamining the bitmask. Following the relinquish- 
ing of the lock to a second computation agent, process 200 
proceeds to decisional step 260. Since in this case, an 
intervening interrupt has occurred, the computation agent 
does not have ownership of the lock. Therefore it needs to 
re-register for the lock. In order to accomplish this, the 
computation age nt proceeds to restore its priority level tojts 
orig inal value , as depicted in step 280, andr eioins the queu e. 

The interrupt handler actions are reTallvely straightfor- 
ward. An exemplary pseudo-code for the actions to be taken 
prior to actually servicing the interrupt entitle InterruptAc- 
tion ( ) is listed below in Table 5. 

TABLE 5 
Interrupt Handler Action 

InterruptAction ( ) 
{ 



} 



1. if (oldmask — INVALID) 

2. return 

3. [if (oldslot - slot) ~WANT; slot « 

4. INT_HAPPENED - TRUE 

5. if (oldslot — GOTTT !! 

6. (oldmask = 0 && 

7. oldslot — WANT) ) { 

8. Release ( ) 
return; 

9. } else 

10. MASK &- Imypid 



DONTWANT] 



Prior to executing the interrupt code that will service the 
received interrupt, the interrupt handler first checks to deter- 
mine if the computation agent is queued waiting for the lock. 
If the computation agent is not in the queue, the interrupt 
handler proceeds to execute the interrupt code. On the other 
hand, if the computation agent is in the queue, the interrupt 
handler atomically obtains the computation agent's local slot 
value and, if the value is "WANT," changes it to "DONT- 
WANT/' This will indicate to any potential releaser of the 
lock that the computation agent is, at least temporarily, not 
interested in acquiring ownership of the lock. Furthermore, 
the interrupt handler must determine whether or not the 
interrupted computation agent has already been granted 
ownership of the lock or will assume ownership of the lock, 
i.e., there were no other waiting computation-agents wanting 
thelock^If so, the lock must be released, e.g.. by callin g 
R elease( ) described abov e, that _will pass ownership of th e 
lpjgkjoJhe-B OKt waiun gx r^putation^g e^ The 
interrupt handler can assume that the interrupted comp uta- 
tionji gent has be ejQ_gcflnted the- Jock if the captured value of 
thft_g)nt is "nQjrr ** The interrupt handler can determine if 
the interrupted computation agent will assume ownership of 
the lock by checking the captured value of the mask. In the 
event that the bitmask is "0," the computation agent can 
assume that it was the only requestor for the lock and that 
this is the first interrupt since that event. In those conditions 
where the computation agent has not been assigned the lock, 
the interrupt handler will remove the computation agent's bit 
from the bitmask field. The Release code detailed above in 
Table 4 depends on this being eventually accomplished 
because it will keep checking the bitmask until it finds a 
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compulation agent to which it can grant the lock to, or until 
there are no computation agents waiting for the lock. 

The interrupt handler and lock release operations 
described above are defined to close a second window of 
vulnerability. For example, consider a computation agent 
that has queued for the lock and is servicing an interrupt. If 
a current lock holder grants it the lock, the responsibility for 
passing the lock to another computation agent falls upon the 
interrupt handler. If the interrupt handler succeeds in updat- 
ing the computation agent's slot value to "DONTWANT," 
the current lock holder then selects a different computation 
agent to which it will grant the lock to. Consequently, a 
computation agent that takes an interrupt leaves the queue 
and only rejoins the queue after the interrupt has been 
serviced. 

It should be noted that although the present invention uses 
state variables such as oldmask and INT__HAPPENED, the 
information conveyed by these variables to the interrupt 
handler could be conveyed through the slot variable. In 
addition, it should also be apparent to those skilled in the art 
that the scheme outlined above for jtcquirino lorfr rmilrl hp 
easilvmodified to a conditional lock acquire wherein t he 
lodcjsjacquired only it' it is free. Such a conditional lock 
acq uire scheme h as many uses: Furthermore, it should be 
noted that although the present invention has been described 
in the context of a computer system, those skilled in the art 
will readily appreciate that the present invention is also 
capable of being distributed as a computer program product 
in a variety of forms; the present invention does not con- 
template limiting its practice to any particular type of 
signal-bearing media, i.e., compu tcr_i&adabl e medium^ti- 
lized to actually carry out the distribution. Examples of 
signal-bearing media includes recordable type media, such 
as floppy disks and hard disk drives, and transmission type 
media such as digital and analog communication links. 

In a preferred embodiment, the present invention is imple- 
mented in a multiprocessor system programmed to execute 
the method described herein. Accordingly, in an advanta- 
geous embodiment, sets of instructions for executing the 
method disclosed herein are resident in RAM of one or more 
of processors configured generally as described herein- 
above. Until required by the jnultiprocessor system, the set 
of instructions may be stored as com puter pro gram produ ct 
in anolhe«omput er_memory r ^ .g.. a disk drive. In another 
advantageous embodiment, the computer program product 
may also be stored at another computer and transmitted to a 
user's computer system by an internal or external commu- 
nication network, e.g., LAN or WAN, respectively. 

The present invention may be embodied in other specific 
forms without departing from its spirit or essential charac- 
teristics. The described embodiments are to be considered in 
all respects as illustrative and not restrictive. The scope of 
the invention is, therefore, indicated by the appended claims 
rather than by the foregoing description. All changes which 
come within the meaning and range of equivalency of the 
claims are to be embraced within their scope. 

What is claimed is: 

1. A method for a computation agent to acquire a queue 
lock in a multiprocessor system that prevents deadlock 
between said computation agent and external interrupts, 
comprising the steps of: 

(a) joining a queue to acquire a lock; 

(b) receiving ownership of said lock; 

(c) raising a first priority level of said computation agent 
to a higher second priority level in response to receiv- 
ing ownership of said lock; 
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(d) relinquishing ownership of said lock in response to a 
receipt of an external interrupt having a higher priority 
level, wherein said external interrupt occurs prior to 
said computation agent raising its priority level to said 
second priority level; 

(e) determining if said computation agent has acquired 
possession of said lock after it has raised its priority 
level; and 

(f) repeating step (a) in response to said determination that 
said computation agent has not acquired possession of 
said lock. 

2. The method as recited in claim 1 wherein said step of 
repeating step (a) includes the step of restoring said com- 
putation agent's priority level to said first priority level. 

3. The method as recited in claim 1 wherein said second 
priority level of said computation agent is higher than said 
priority level of said external interrupt. 

4. The method as recited in claim 1 wherein said queue is 
implemented using a global bitmask. 

5. The method as recited in claim 4 wherein the number 
of bits in said global bitmask is equal to the number of 
computation agents in said multiprocessor system. 

6. The method as recited in claim 1 wherein said step of 
joining a queue includes the step of spinning locally. 

7. The method as recited in claim 4 wherein said step of 
relinquishing ownership includes the step of releasing said 
lock to a second computation agent. 

8. The method as recited in claim 7 wherein said second 
computation agent is the next computation agent on the left 
of said computation agent in said global bitmask that is 
requesting ownership of said lock. 

9. The method as recited in claim 4 wherein said step of 
relinquishing ownership includes the step of giving prece- 
dence to a computation agent located on the same node of a 
non-uniform memory access (NUMA) system as said com- 
putation agent. 

10. A computer program product, comprising: 

a computer-readable medium having stored thereon com- 
puter executable instructions for implementing queue 
locks in a multiprocessor system that prevents deadlock 
between a computation agent and external interrupts 
when said computation agent is attempting to acquire a 
lock, said computer executable instructions when 
executed, perform the steps of: 

(a) joining a queue to acquire a lock; 

(b) receiving ownership of said lock; 

(c) raising a first priority level of said computation 
agent to a higher second priority level in response to 
receiving ownership of said lock; 

(d) relinquishing ownership of said lock in response to 
a receipt of an external interrupt having a higher 
priority level, wherein said external interrupt occurs 
prior to said computation agent raising its priority 
level to said second priority level; 

(e) determining if said computation agent has acquired 
possession of said lock after it has raised its priority 
level; and 

(f) repeating step (a) in response to said determination 
that said computation agent has not acquired posses- 
sion of said lock. 

11. The computer program product as recited in claim 10 
wherein said step of repeating step (a) includes the step of 
restoring said computation agent's priority level to said first 
priority level. 

12. The computer program product as recited in claim 10 
wherein said second priority level of said computation agent 
is higher than said priority level of said external interrupt. 
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13. The computer program product as recited in claim 10 means for relinquishing ownership of said lock in 
wherein said queue is implemented using a global bitmask. response to a receipt of an external interrupt having 

14. The computer program product as recited in claim 13 a higher priority level, wherein said external inter- 
wherein the number of bits in said global bitmask is equal to ™pt occurs prior to said computation agent raising its 
the number of computation agents in said multiprocessor 5 priority level to said second priority level; and 
system. means for determining if said computation agent has 

15. The computer program product as recited in claim 10 acquired possession of said lock after it has raised its 
wherein said step of joining a queue includes the step of priority level. 

spinning locally. 20. The multiprocessor system as recited in claim 19 

16. The computer program product as recited in claim 13 10 wherein said means for determining includes means for 
wherein said step of relinquishing ownership includes the rejoining said queue in response to said computation agent 
step of releasing said lock to a second computation agent not acquiring possession of said lock 

17. The computer program product as recited in claim 16 21. The multiprocessor system as recited in claim 19 
wherein said second computation agent is the next compu- wherein said shared memory comprises a plurality of 
tation agent on the left of said computation agent in said 15 memory. 

global bitmask that is requesting ownership of said lock. 22. The multiprocessor system as recited in claim 20 

18. The computer program product as recited in claim 13 wherein said means for rejoining said queue includes means 
wherein said step of relinquishing ownership includes the for restoring said computation agent's priority level to said 
step of giving precedence to a computation agent located on priority level. 

the same node of a non-uniform memory access (NUMA) 20 23. The multiprocessor system as recited in claim 19 

system as said computation agent. wherein said second priority level of said computation agent 

19. A multiprocessor system, comprising: is higher than said priority level of said external interrupt, 
a plurality of processors; 24 * ^ multiprocessor system as recited in claim 19 

wherein said means for joining a queue includes means for 

a s are memory, ^ spinning said computation agent locally, 

a system bus for interconnecting said shared memory and 25. The multiprocessor system as recited in claim 19 

said plurality of processors; and wherein said means for relinquishing ownership includes 

means for a computation agent to acquire a queue lock means for releasing said lock to a second computation agent, 

that prevents deadlock between said computation agent 26. The multiprocessor system as recited in claim 19 

and external interrupts, including: 30 wherein said means for relinquishing ownership includes 

means for joining a queue to acquire a lock; means for giving precedence to a computation agent located 

means for receiving ownership of said lock; on the same node of a non-uniform memory access (NUMA) 

means for raising a first priority level of said compu- system as said computation agent, 
tation agent to a higher second priority level in 

response to receiving ownership of said lock; ***** 
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